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a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 
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the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
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Certain claims were found unsearchable (See Box I). 
Unity of invention is lacking (see Box II). 



4. With regard to the title, 

PT| the text is approved as submitted by the applicant. 

| | the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

PC] the text is approved as submitted by the applicant. 

I I the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
1 — 1 within one month from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. 



I I as suggested by the applicant. |X) None of the figures. 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 
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I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments.): 



Description, pages: 

1-7,9-13,16,17,19-122, 
124-131 



8,14,15,18,123 



as received on 



as originally filed 



21/06/2000 with letter of 



19/06/2000 



Claims, No.: 

1-53 



as received on 



2 1 /06/2000 with letter of 



19/06/2000 



Drawings, sheets: 

1/9-9/9 as originally filed 

2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 



3. G3 This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

see separate sheet 

4. Additional observations, if necessary: 
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Inventive step (IS) 
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Claims 
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Claims 
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Claims 
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Claims 


Yes: 


Claims 


No: 


Claims 



or 28) 

1-7,9-2 
or 28) 

1-7,9-2 
or 28) 



> vwnere mis suDject-matter does not refer to claim 8 

i (where this subect-matter does not refer to claim 8 
(where this subject-matter does not refer to claim 8 



2. Citations and explanations 
see separate sheet 



VIII. Certain observations on the international application 
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1. Regarding Part I: 

a. The references to the deposition number I-2049 and the date of deposition of 
June 30 1998 inserted into the amended description pages 8, 14, 15, 18 and 123 
and into newly amended claims 8 and 28 have not been taken into account during 
this examination. The requirements of R. 91.1 g)ii PCT have not been met since 
although it is clear that information in this respect has been omitted in error, it is 
not clear precisely what information should be included in order to rectify the error. 
Thus these pages and claims do not satisfy the requirements of Art. 19(2) PCT, 
since they include additional information. 

2. Regarding Part VIII, Art. 6 PCT: 

a. The designations of the BAC groups of claim 24 as Rv101 etc as recited in the 
claims are arbitrary and convey no meaning. This also applies to the 
embodiments of claim 25 and to the subject-matter of claim 30 with respect to the 
arbitrary designations used therein. 

b. Claim 1 1 refers to a purified polynucleotide of interest that has been isolated 
according to the method of claim 9. Claim 1 1 is not characterised in terms of 
technical features per se, only by reference to the way in which it has been 
obtained and is therefore unclear, since no technical distinction can be implied 
with respect to the products derived from such methods. The above unclarity also 
applies, mutatis mutandis with respect to claims 12, 17, 18, 21 (for a pair of 
purified polynucleotides according to claim 11), 29, re. part (a), 31 re. parts (a) 
and (b), 34 re. part (a), 36 re. part (a), 37 re. parts (a) and (c), 38 re. part (a), 39 
re. part (a), 40 re. part (a), 47 and 48. 

The lack of clear characterisation of claim 1 1 moreover, implies a lack of novelty 
per se (see observations regarding Part V below). 

c. There is no further definition, either functional or otherwise of the polynucleotide 
which hybridises under stringent conditions in part (c) of claim 14. This is therefore 
indeterminate. This also applies, mutatis mutandis to part (c) of claim 20. 
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d. The polynucleotide of claim 17 is also indeterminate, since the term "involved in 
the pathogenicity of a Mycobacterium strain" is itself an indeterminate functional 
attribute which cannot be used to precisely define the polynucleotide in question. 
Claim 18 is also not a precise characterisation. The structure of all or part of a 
"Polymorphism Glycine Rich Sequence" is not apparent from the claim. 

e. Since there is no precise and unique art definition of what constitutes the size and 
structure of vectors termed "bacterial artificial chromosome vectors", which 
renders such vectors even without inserted DNA unique over others of the art, the 
subject-matter of claim 26 is not clear per se. The dependent claims 29 (re. claim 
26), 30, 36, 37, 39, 40, 41 , 42, 49, 50 and 51 are thus also unclear. 

2. Regarding Part V, Art. 33 PCT: 

a. The subject-matter of the claims subject to examination are considered to enjoy 
the priority right of application US 09/060,756, filed 06/04/1998. 

Consequently, Brosch et al. (1998), Infection and Immunity, Vol. 66 pp. 2221-2229 
and also Cole et al. (1998) Nature Vol. 393 pp. 537-544 are currently not 
considered to be art for the purposes of Art. 33 PCT. 

3. Philipp et al. (1 996) Microbiology Vol. 1 42 pp. 31 35-31 45 reports the physical 
mapping of differences between the genome maps, M. bovis BCG and M. 
tuberculosis H37Rv which also include loci for the protein antigens ESAT-6 and 
mpt64. The maps and comparisons were made using material, inter alia from 
genomic cosmid DNA libraries (see Summary and Methods). The difference 
between the subject-matter of claim 26 differs from the art in the use of bacterial 
artificial chromosome vectors for the make up of the DNA library. 

Since BACs have been used as vectors for the creation of a number of genomic 
DNA libraries (for examples (see the art cited in the passage of current 
description page 2, lines 10-12, which includes Kim et al. (1996) Genomics Vol. 
34. pp. 213-218)), the use of same to make genomic libraries of Mycobacterium 
strains and species would be obvious, in view of their advantages in general over 
cosmids. There would furthermore, not appear to be any surprising effects and/or 
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advantages associated with such an implementation in the context of 
Mycobacterium. Generic claim 26 thus lacks an inventive step under Art 33(1 )(3) 
PCT. 

The same applies mutatis mutandis with respect to the specific subject-matter of 
claims dependent on claim 26, since the skilled person would consider the use of 
BACs as the basis for any genomic DNA library based on Mycobacterium to be a 
logical option. The same also applies with respect to the subject-matter of claims 
22-25. 

d. Claim 1 is directed to a method for isolating a polynucleotide of interest that is 
present in the genome of a mycobacterium strain and/or is expressed by said 
mycobacterium strain and is absent or altered in the genome of a different 
mycobacterium strain and/or is not expressed in said different mycobacterium 
strain, said method comprising the use of at least one clone belonging to a 
genomic DNA library of a given Mycobacterium strain, said DNA library being 
cloned in a bacterial artificial chromosome (BAC) vector. 

This and the remaining claimed subject-matter is the an extension application of 
comparative genomic studies using BAC vector based genomic libraries and the 
use of theses in order to identify polynucleotides and corresponding encoded 
polypeptides in various known strains and spp. of Mycobacterium. The principle 
of this technique has been applied in other fields, for example in the use of BACs 
for the cloning of human genomic libraries. The identification of differing loci has 
already been achieved in Philipp et at. (1996) supra. The skilled person would 
consider the identification and isolation of polynucleotides and thereby encoded 
polypeptides to be a logical extension of the prior art. 
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been deposited in the Collection Nationale de Cultures de Microorganismes 
(CNCM) on June 30, 1998 under the accession number 1-2049. 

Thus, as a specific embodiment of the above described method for 
isolating a polynucleotide of interest said method makes use of at least one B AC- 
based DNA library that has beeen constructed from the genomic DNA of 
Mycobacterium tuberculosis, more specifically of the H37Rv strain and 
particularly of the DNA library deposited in the accession number 1-1945. 

In another specific embodiment of the above described method for 
isolating a polynucleotide of interest said method makes use of at least one BAC- 
based DNA library has beeen constructed from the genomic DNA of 
Mycobacterium bovis BCG, more specifically of the Pasteur strain and 
particularly of the DNA library deposited in the accession number 1-2049. 

In more details, the method according to the invention for isolating a 
polynucleotide of interest may comprise the following steps : 

a) isolating at least one polynucleotide contained in a clone of a BAC-based 
DNA library of mycobacterial origin; 

b) isolating : 

- at least one genomic or cDNA polynucleotide from a mycobacterium, said 
mycobacterium belonging to a strain different from the strain used to construct 
the BAC-based DNA library of step a); or alternatively 

- at least one polynucleotide contained in a clone of a BAC-based DNA library 
prepared from the genome of a mycobacterium that is different from the 
mycobacterium used to construct the BAC-based DNA library of step a); 

c) hybridizing the at least one polynucleotide of step a) to the at least one 
polynucleotide of step b); 

d) selecting the at least one polynucleotide of step a) that has not formed a 
hybrid complex with the at least one polynucleotide of step b); 

e) characterizing the selected polynucleotide. 

Following the above procedure, the at least one polynucleotide of step a) 
may be prepared as follows : 

1) digesting at least one recombinant BAC clone by an appropriate resctriction 
endonuclease in order to isolate the polynucleotide insert of interest from the 
vector genetic material; 

2) optionally amplifying the resulting polynucleotide insert; 
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disease and elicits a variable antibody response suggesting either that individuals 
mount different immune responses or that this PGRS-protein may not- be 
produced in this form by all strains of M tuberculosis. In other words, at least 
some PEPGRS coding sequences encode for proteins that are involved in the 
recognition of M. tuberculosis by the immune system of the infected host 
Therefore, differences in the PGRS sequences could represent the principal 
source of antigenic variation in the otherwise genetically and antigenically 
homogeneous bacterium. 

By performing the method of the invention using the M. tuberculosis BAC 
based DNA library 1-1945, the inventors have discovered the occurence of 
sequence differences between a given PGRS encoding ORF (ORF reference on 
the genomic sequence of M. tuberculosis Rv0746) of M. tuberculosis and its 
counterpart sequence in the genome of M bovis BCG. 

More precisely, the inventors have determined that one ORF contained in 
BAC vector N° Rv418 of the M. tuberculosis BCG 1-1945 DNA library carries 
both base additions and base deletions when compared with the corresponding 
ORF in the genome of M. bovis BCG that is contained in the BAC vector N° 
X0175 of the M bovis BCG 1-2049 DNA libary. The variations observed in the 
base sequences correspond to variations in the C-terrninal part of the aminoacid 
20 sequence of the PGRS ORF translation product. 

As shown in Figure 6, an amino acid stretch of 9 residues in length is 
present in this M tuberculosis PGRS (ORf reference Rv0746) and is absent from 
the ORF counterpart of M. bovis BCG, namely the following amino acid 
sequence: 

25 NH2-GGAGGAGGSSAGGGGAGGAGGAGGWLLGD-COOH. 

Furthermore, Figure 6 shows also that an amino acid stretch of 45 residues in 
length is absent from this M. tuberculosis PGRS and is present in the ORF 
counterpart of M. bovis BCG, namely following amino acid sequence: 

NH2-GAGGIGGIGGNANGGAGGNGGTGGQLWGSGGAGVEGGAAL 
30 SVGDT-COOH. 

Similar observations were made with PPE ORF Rv0442, which showed a 
5 codon deletion relative to a AC bovis amino acid sequence. 

Given that the polymorphism associated with the PE-PGRS or PEE ORFS 
resulted in extensive antigenic variability or reduced antigen presentation, this 
would be of immense significance for vaccine design, for understanding 
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protective immunity in tuberculosis and, possibly, explain the varied responses 
seen in different BCG vaccination programmes. 

There are several striking parallels between the PGRS proteins and the 
Epstein-Barr virus-encoded nuclear antigens (EBNA). Both polypeptide families 
are glycine-rich, contain Gly-Ala repeats that represent more than one third of the 
molecule, and display variation in the length of the repeat region between 
different isolates. The Gly-Ala repeat region of EBNA1 has been shown to 
function as a exacting inhibitor of antigen processing and MHC class I- 
restricted antigen presentation (Levitskaya et al., 1995). The fact that MHC class 
I knock-out mice are extremely suscepible to M tuberculosis underlines the 
importance of MHC class I antigen presentation in protection against 
tuberculosis. Therefore, it is possible that the PE/PPE protein family also play 
some role in inhibiting antigen presentation, allowing the bacillus to hide from 
the host's immune system. 

As such the novel and nonobvious PGRS polynucleotide from M. bovis 
which is homolog to the M. tuberculosis ORF Rv0746, and which is contained in 
the BAC clone N° X0175 (See Table 4 for SP6 and T7 end-sequences of clone 
n° X0175) of the 1-2049 M. bovis BCG BAC DNA library is part of the present 
invention, as it represents a starting material in order to define specific probes or 
primers useful for detection of antigenic variability in mycobacterial strains, 
possible inhibition of antigen processing as well as to differentiate M. 
tuberculosis fromM bovis BCG. 

Thus, a further object of the invention consists in a polynucleotide 
comprising the sequence SEQ ID N°4. 

Polynucleotides of interest have been defined by the inventors as useful 
detection tools in order to differentiate M. tuberculosis fromM bovis BCG. Such 
polynucleotides are contained in the 45 aminoacid length coding sequence that is 
present in M bovis BCG but absent from M tuberculosis. This polynucleotide 
has a sequence beginning (5'end) at the nucleotide at position nt 729 of the 
sequence SEQ ID N°4 and ending (3 'end) at the nucleotide in position nt 863 of 
the sequence SEQ ID N°4. 

Thus, part of the present invention is also a polynucleotide which is 
chosen among the following group of polynucleotides : 

a)--a polynucleotide comprising at least 8 consecutive nucleotides of the 
35 nucleotide sequence SEQ ID N°5 ; 
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Rvl06; Rv39; Rv255; Rv74; Rv355; Rv268; Rv58; Rvl73; Rv264" Rv417- 
Rv401; Rvl44; Rv302; Rv81; Rvl63; Rv281; Rv221; Rv420; Rvl75; Rv86' 
Rv412; Rv73; Rv269; Rv214; Rv287; Rv42; Rvl43. 

The polynucleotides disclosed in Table 3 may be used as probes in order 
to select a given clone of the BAC DNA library 1-1945 for further use. 

The invention also provides for a BAC-based Mycobacterium bovis strain 
Pasteur genomic DNA library that has been deposited in the Collection Nationale 
de Cultures de Microorganismes on June 30, 1998 under the accession number I- 
2049. 

A further object of the invention consists in a recombinant BAC vector 
which is chosen among the group consisting of the recombinant BAC vectors 
belonging to the BAC-based DNA library 1-2049. This DNA library contains 
approximately 1600 clones. The average insert size is estimated to be -80 kb. 

Generally, a recombinant BAC vector of interest may be chosen among 
the following set or group of BAC vectors contained in the BAC-based DNA 
library 1-2049 : 

X0001; X0002; X0003; X0004; X0006; X0007; X0008; X0009; X0010; X0012; 
X0013; X0014; X0015; X0016; X0017; X0018; X0019; X0020; X0021; X0175. ' 

The end sequences of the polynucleotide inserts of each of the above 
clones corresponding respectively to the sequences adjacent to the T7 promoter 
and to the Sp6 promoter on the BAC vector are shown in Table 4. 

The polynucleotides disclosed in Table 4 may be used as probes in order 
to select a given clone of the BAC DNA library 1-2049 for further use. 

Are also part of the invention the polynucleotide inserts that are contained 
in the above described BAC vectors, that are useful as primers or probes. 

These polynucleotides and nucleic acid fragments may be used as primers 
for use in amplification reactions, or as nucleic probes. 

PCR is described in the US patent N° 4,683,202. The amplified fragments 
may be identified by an agarose or a polyacrylamide gel electrophoresis, or by a 
capillary electrophoresis or alternatively by a chromatography technique (gel 
filtration, hydrophobic chromatography or ion exchange chromatography). The 
specificity of the amplification may be ensured by a molecular hybridization 
using, for example, one of the initial primers as nucleic probes. 

-Amplified nucleotide fragments are used as probes in hybridization 
reactions in order to detect the presence of one polynucleotide according to the 
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Table 4 : End-sequences of the polynucleotide inserts cloned in the named recombinant 

BAC vectors contained in the 1-2049 M bovis strain Pasteur genomic DNA library. 

RvXXXSP6 corresponds to the SP6 end-sequence of the clone RvXXX. 

RvXXXT7 corresponds to the T7 end-sequence of the clone RvXXX. 

RvXXXIS 1081 corresponds to a region located close to a copy of the IS 1081 repetitive 

sequence (Insertion element). 

The character « - » denotes an uncertain base residue. 



Clone X0001 

::::::::::::::: : X0001SP6 . seq: ::::::::::::: 
AAG- 

TCGGGTTTCCACACGCGCGGTTTGACCCTAGTCATATGTAATCATGTGTACCATGTGCGGGCGCTTTTCGACGGCCG 

CGAACCACCGGA--ATTTCCTGTGATTTCACTGCATGCGTACCATCTGGCACAATTGAGCA-TTGTCT- 

TCGCGGTGGTCGG-CGGGTTGCGTGCCGCCTGCTGCGA-ATGCACCA- 

TAAGCCCGAACCCACCGGCTTGGTGACCACCGCACGCTGCGTGTGGGGGGTAACCACTCCGCGACCCCAAGGATGGT 
CATTTCCAATGAACCGGCTGGACTTCGTCCA-A 

::::::::::::: : X0001T7 . seq: ::::::::::::: 

GTCGCGGTTCGATCGACCCGATCTTCACCTCGTAACCTCGATGCTTAGCAGGATCCAGCTTGACCGCGTTTGGCTCT 
ACCCACTCTTTGAGTGGCGCCGTCGCCTGTGCCCCATCGGTGTTCATGACGAACGCTTCGAAAGACTTCCTCTTGTG 
AGCCGGAATGTCTGCGTAAAGAAGTTCCATGTCCGGGAAGTAGACCCGGTCGCCCTCCACGTGGTACTCCTTCGAGG 
TCCGCTTCTCGCCGGATCCGATAAACACCGGCCCCAGGCACCGCAGCGTGAGTTCGAACGGCTTCAGGTAGGTGTTC 
ATGCGGCGGACTCCGGGAGTGCGAGAAATAGCGGTCGCGCGTAGCTGTAGACCGGATGGTTTCCGCCCAGGCTGACG 
TCGAAGATGCCTCCTTGGAAGGGGCGCGA 



Clone X0002 

::::::::::::: : X0002SP6 . seq: ::::::::::::: 

AACTCAAGTTTTTACGGTGATCGCGCATCACCTGGTTCATGAACTGGAAGCAGCGCAGCGCTTCCTTTTCGGCCGCA 
ACATGAGCCAGCCTCTCGTCGGCGGTCGGGTGCAGGTGCTCGGGCAGCTCGGCCGCGACAGCCGCCTGACCCTGAAA 
CCAGCTTCCATATCCCGCGAC— 

AACGACGCCAGTCCGCTACGTAACCCCTCCGCGACTGTCCTVTGGACAACAGCGCGTTCTCC^CCGACCGGGCCCGGG 
TGT 



::::::::::::: :X0002T7 . seq: ::::::::::::: 

GTGCAGGTTTCGAC^TGTGGTGCCGGTTCGGCGGCTACGTGCCATCGAGAC^CTGGCGC^-GCTATCGC^CCCGTT 
ATC GGCT GCGAGCAAAT CGCGGTAT GCGTT CTT GAGCAT GAGTCGGC GACCGTCGTCATGGTCGACACCCACGACGG 
AAAGACGCAGATCGCCGTCAAGCATGTGTGCCGCGGATTATCAGGACTGACCTCCTGGCTGACCGGCATGTTTGGTC 
GCGATGCCTGGCGCCCGGCCGGCGTGGTCGTGGTCGGCTCGGATAGCGAGGTCAGCGAATTCTCGTGGCAGCTCGAA 
AGGGT CCT GCCGGT GCCGGT 



Clone X0003 

::::::::::::: :X0003SP6 . seq: ::::::::::::: 
TTCGAGTCATGCGCCCGCCTCGACCACGAA-ATGCACGTCG- 

GGTTCGATCGACCCGATCTTCACCTCGTAACCTCGATGCTTAGCAGGATCCAGCTTGACCGCGTTTGGCTCTACCCA 
CTCTTTGAGTGGCGCCGTCGCCTGTGCCCCATCGGTGTTCATGACGAACGCTTCGAAAGACTTCCTCTTGTGAGCCG 
GAATGTCTGCGTAAAGAAGTTCCATGTCCGGGAAGTAGACCCGGTCGCCCTCCACGTGGTACTCCTTCGAGGTCCGC 
TTCTC 
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1. A method for isolating a polynucleotide of interest mat is present in the 
genome of a mycobacteria, strain and/or is expressed by said mycobacteria, 
stiatn and ma, is absent or altered in the genome of a different mycobacteria* 
stram and/or ts no, expressed in said different mycobacteria* strain, said memod 
compnsmg the use of at leas, one clone belonging to a genomic DNA library of a 
»ven mycobafcrium strain, said DNA library being cloned in a bacterial artificial 
chromosome (BAC) vector. 

2. He memod according to claim 1, wherein the BAC-based DNA librae 
has beeen constructed from genomic DNA of Mycobacterium miosis 

3. He memod according to claim 2, wherein the BAC-based DNA library 
has beeen constructed from genomic DNA of Mycobacterium tubercuiosis ^ 

H37Rv. 

4. The memod according to claim 3, wherein me BAC-based DNA library 
been deposited in me CoUection National, de Cultures de Microorganismes 

(CNCM) on November 19, 19 97 under me accession number 1-1945. 

5. The method according to claim 1, wherein the BAC-based DNA library 
has been constructed from genomic DNA of Mycobacterium bovis. 

6. The method according to daim 5, wherein the BAC-based DNA library 
has been constructed from me genomic DNA of Mycobacterium bovis BCG 
strain Pasteur. 

7. The memod according ,o daim 6, wherein said DNA libnuy contains 
approxuuatively 1600 Cones and wherein me genomic DNA is cloned into a 
recombinant pBeloBACl 1 vector with an average insert size of approximate* 

8. The memod according to claim 6 or 7, wherein the a, least one BAC- 
based DNA library has been deposited in the CoUection National de Cuhures de 
M.croorganismes (CNCM) on June 30, !998 under the accession number 1-2049. 
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9. A method of isolating a polynucleotide of interest that is present in a 
genome of a first mycobacterium stain or that is expressed by the first 
mycobacteria, stain and mat is absent or altered in a genome of a second 
mycobacterium stain or tha, is not expressed by the second mycobacterium 

5 strain, said method comprising : 

a) prodding at least one polynucleotide contained in a clone of a bacterial 
artificud chromosome (BAC) DNA library of the firs, mycobacterium stain- 

b) prodding at least one genomic or cDNA polynucleotide from a second 
mycobacterium stain mat is different from the firs, mycobacterium stain or at 

10 least one polynucleotide contained in a clone of a BAC DNA library prepared 
from me genome of me second mycobacterium stain; 

c) contacting under hybridizing conditions the polynucleotide of step a) with the 
polynucleotide of step b); and 

d) isolating me polynucleotide of step a) that has no. formed a hybrid complex 
15 with the polynucleotide of step b). 

10. The method of claim 9, wherein the polynucleotide contained in a 
clone of a BAC DNA library of the first or second mycobacterium strain is 
prepared by the following procedure : 

1) digesting at least one recombinant BAC clone by an appropriate resection 
20 endonuclease to yield a polynucleotide insert of interest; and 

2) isolating the polynucleotide insert of interest. 

1 1. A purified polynucleotide of interest that has been isolated according 
to the method of claim 9. 

12. The purified polynucleotide of claim 11 which contains at least one 
25 Open Reading Frame (ORF). 

13. The purified polynucleotide of claim 12, which is SEQ ID N0:1. 

14. The purified polynucleotide of claim 12, wherein said polynucleotide 
is selected from the group consisting of : 



AMENDED SHEET 



21-06-2000 



IB 009900740 



> • • 
• • • • 



134 



10 



a) a polynucleotide comprising at least 8 consecutive nucleotides of SEQ ID 
N0:1; 

b) a polynucleotide having a sequence fully complementary to SEQ ID N0:1; and 

c) a polynucleotide that hybridizes under stringent hybridization conditions with 
the polynucleotide defined in a) or with the polynucleotide defined in b). 

15. The purified polynucleotide of claim 14, which is SEQ ID N0:2. 

16. The purified polynucleotide of claim 14, which is SEQ ID N0:3. 

17. The purified polynucleotide of claim 12, wherein the ORF encodes all 
or part of a polypeptide involved in the pathogenicity of a mycobacterium strain. 

18. The purified polynucleotide of claim 12, wherein the ORF encodes all 
or part of a Polymorphism Glycine Rich Sequence (PGRS). 

19. The purified polynucleotide of claim 18, which is SEQ ED N0:4. 

20. The purified polynucleotide of claim 18, which is selected from the 
group consisting of: 

15 a) a polynucleotide comprising at least 8 consecutive nucleotides the of SEQ ID 

N0:5 ; 

b) a polynucleotide having a sequence that is fully complementary to SEQ ID 
N0:5; 

c) a polynucleotide that hybridizes under stringent hybridization conditions with 
20 the polynucleotide defined in a) or with the polynucleotide defined in b). 

21. A pair of the purified polynucleotides as claimed in claim 1 1. 

22. A Mycobacterium tuberculosis strain Rv37 genomic DNA library that 
has been deposited in the Collection Nationale de Cultures de Microorganismes 
under accession number 1-1945, wherein said genomic DNA library comprises 

25 recombinant bacterial artificial chromosome vectors. 

23. A recombinant bacterial artificial chromosome (BAC) vector, which 
belongs to the genomic DNA library of claim 22. 

24. The recombinant BAC vector of claim 23, which is selected from the 
group consisting of : 
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RvlOl; Rvl02; Rvl03; Rvl04; R vl 05; Rvl06; Rvl07; Rvl08 ; RvI09 RvlO 
RvlIO; Rvlll; Rvll2; RvllS; RvlU; Rvl 1J; Rvllfi; R v „ 7; Rvllg . RvU9 . 
RvU; Rvl20 ; R vl2I; Rvl22 . Rvl23 . Rvl24 . Rvm . Rvi2?; Rvi28; " Rvi29 - 

Rvl30; RV132; Rvl34; Rvl35; Rvl36; Rv 137; Rvl38; Rvl39; Rvl3- RvHO- 
Rvl 41 ; Rvl42; Rvl43 ; Rvl44; Rvl45; R vl 46; R v 147; Rvl48; R vl 49- Rvl4- 
RV150; RV151; Rvl52; R vI 53; Rvl54 ; Rvl5 5; Rvl56; Rvl57; Rvl59- Rvl 5 - 
RV160; Rv!61; Rvl62 ; RW63; RvI64; Rvl65; Rvl66; Rvl67 ; Rvl69- Rvl6- 
RV170; Rvl71; Rvl72; Rvl73; R vl 74; Rvl75; Rvl76; Rvl77; Rvl78- Rvl79- 
RV17; Rvl80; Rvl81; Rvl82; Rvl83 ; Rvl84 ; Rvl85; Rvl86 ; Rvl87; RW88- 
Rv 18 ; Rvl90; R Vl 9I; Rvl92; Rvl93 ; Rvl94 : Rvl95; R vl 96 ; Rvl9; Rvl- Rv201-' 
Rv204; RV205; Rv207; Rv209; Rv20; R.2,4; Rv215; Rv217 : Rv218 Rv219' 
Rv2 1; Rv22 0; Rv2 21 ; R v222; RV223; Rv2 24 ; Rv225; Rv226; Rv227; Rv228- 
RV229; Rv 22 ; R v2 30; Rv231; Rv232; R v2 33; Rv234; Rv235; Rv237; Rv24<,' 
RV241; RV243; Rv244; Rv245; Rv246; RV247; Rv249; Rv24 ; Rv2 51- Rv25 2 - 
RV253; Rv254; Rv255 ; Rv257 ; Rv258 ; Rv2J9; Rv2 5; Rv260; Rv2«; Rv2«2- 
RV263; Rv264; Rv265; Rv266; Rv267; Rv268; Rv269; Rv26; Rv270- Rv271- 
RV272; RV273; RV274; Rv275 ; Rv276; Rv2 77; Rv278; Rv279; Rv27; Rv^' 
RV281; Rv282; Rv283; Rv284 ; Rv285; Rv286; Rv287; Rv^S; Rv289 Rv28- 
RV290; Rv2 9 1; RV292; Rv 2 93 ; R v2 94; Rv295 ; Rv296; Rv29; Rv2; Rv301- 
RV302; Rv303; Rv304; Rv306; Rv307; Rv308; Rv309; Rv30; Rv310- Rv3H- 
RV312; RV313; Rv314; Rv315; Rv316; Rv317; Rv318; Rv319; Rv3r Rv32- 
Rv3 22 ; RV327; Rv3 2 8; Rv3 2 9; Rv3 2; Rv330; Rv33 1; Rv333; Rv334- Rv335- 
Rv336; RV337; Rv338; Rv339; Rv33; Rv340; Rv341; Rv343; Rv344- Rv346- 
RV347; Rv348; Rv349; Rv34; Rv350; Rv351; Rv35 2; R v 353 : Rv354- Rv355- 
RV356; RV357; Rv358; Rv359; Rv35; Rv360; Rv361 ; Rv3«3; Rv364,' Rv365- 
RV366; Rv367; Rv368; Rv369; Rv36; Rv370 ; Rv371 ; Rv373; Rv374- Rv375- 
Rv376; Rv377; Rv378 ; Rv379; Rv37; Rv381; Rv38 2 ; Rv383; Rv384- Rv385- 
RV386; RV387, Rv388; Rv389 ; Rv38; Rv390; Rv391 ; Rv392; Rv393- Rv396- 
RV39; Rv 3; Rv40; Rv412 ; Rv413; Rv414; Rv415; Rv416; Rv417; Rv4 I 8- Rv4 19 - 
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RV41; Rv42; Rv43; Rv44; Rv45; R v 46; Rv47; Rv48; Rv 49; Rv4; Rv50- Rv51 
*v52; Rv54; Rv55; Rv56; RvJ7; Rv5g; Rvs9; Ry ■ ^ 

Rv63; Rv64; Rv65; Rv66 ; Rv67; Rv 68 ; Rv69; Rv6; Rv70; Rv7.. R v72 . Z 3 

Rv»fi' rv76; rv7?; Rv?8; ™ ; *-= ~* rvs, s S 

' RV8?; RV88; RV89; RV8; ^ «^ Rv96 

RSitaR^ RV35; RV415; RV4M; Rv209; R ^ 

RV233, Rb 38; Rv280; Rvl77 ; Rv48; Rv374; Rvl51; Rv238; Rvl56- Rv92- Rv3 
****** Rv243; RV330; Rv285 ; Rv2 3 3; R v219; Rv416; R ^ 

z r R r R ; Rv87; Rv273: rv266; rv25; RvBs; — * «* «i ^ 
*r R r; Rv2i5; Rv329; Rv24o; Rvi9: Rv?4; r ^ ^ 

TIT! ' RV3B; RV3 ° 8; RV22 ° ; R ^ Rv.2! 

5£ » r R ; Rv44: Rv7i; rv?; rv2?; Rvi9i; ^ *«7 

r! •' TL; T 5; Rv74; Rv355; Rv268; Rv58; Rvi73; r -<* *«« 

RV4 R V14 4; Rv302; Rv8 1; R vl63; Rv281; Ry221; Rv420; 
RV412; Rv73; Rv269; Rv214; Rv287; Rv42 and R vl 43 

26. A Mycobac^un, bmis BCG strain Pasteur genomic DNA library 
wheretn satd genomic DNA library Uprises recombinant bacterial ardficZi 
cnromosome vectors. 

27. A Mycobacterium bovis BCG str*™ p,,, 
0 ,. stram Past eur genomic DNA library 
jcordmg to Cairn 26, wherein said DNA notary contains approxhnanvery ,1 

^ 1 *• " " Cl0 " ed fat ° a — ^BAd 1 

vector with an average insert size of approximately 80 kb 

28. A Mycobacterium fa* BCG strain Pasteur genomic DNA library 
accordrng to claim 26, mat has been deposited in the CoUecnon Nanonalel 
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Cultures de Microorganismes (CNCM) on June 30, 1998 under the accession 
number 1-2049. 

29. A recombinant bacterial artificial chromosome (BAC) vector, which 
belongs to the genomic DNA library of claims 26 to 28. 
5 30. A recombinant BAC vector according to claim 29, which is selected 

from the group consisting of: 

X0001; X0002; X0003; X0004; X0006; X0007; X0008; X0009; X0010; X0012; 
X0013; X0014; X0015; X0016; X0017; X0018; X0019; X0020; X0021 and 
X0175. 

10 31. A method for detecting a mycobacterial nucleic acid in a biological 

sample comprising the steps of: 

a) contacting the recombinant BAC vector according to claim 23 or 29, or a 
purified polynucleotide according to claim 11 with the mycobacterial nucleic 
acid in the biological sample ; and 
15 b) detecting a hybrid nucleic acid molecule formed between said recombinant 
BAC vector or said purified polynucleotide and the mycobacterial nucleic acid 
in the biological sample. 

32. The method of claim 31, further comprising before step a), making the 
mycobacterial nucleic acid in the biological sample available to a hybridization 

20 reaction. 

33. A method for detecting mycobacterial nucleic acid in a biological 
sample comprising the steps of: 

a) contacting a first polynucleotide according to claim 11 that has been 
immobilized onto a substrate with the mycobacterial nucleic acid in the 

25 biological sample ; and 

b) contacting a hybrid nucleic acid molecule formed between said first 
polynucleotide and the mycobacterial nucleic acid in the biological sample 
with a second, labeled polynucleotide according to claim 11, wherein said 
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second polynucleotide and said first polynucleotide have non-overlapping 

sequences. 

34. The method of claim 33, further prising before step a), making the 
mycobacterial nucleic acid in the biological sample available to a hybridization 

reaction. 

35. The method of claim 33 or 34, former comprising before step b) 
removmg me mycobacterial nucleic acid that is not hybridized with the 
immobilized first polynucleotide. 

36. A method for detecting mycobacterial nucleic acid in a biological 
sample comprising the steps of: 

a) contacting the mycobacterial nucleic acid in the biological sample with a pair 
of purified polynucleotides according to claim 21 ; 

b) amplifying said mycobacterial nucleic acid ; and 

c) detecting the amplified mycobacterial nucleic acid. 

37. The method of claim 36, further comprising before step a), making the 
mycobacterial nucleic acid in the biological sample available to a hybridization 
reaction. 

38. A kit for detecting a mycobacterium in a biological sample comprising- 

a) a recombinant BAG vector according to claim 23 or 29, or a purified 
polynucleotide according to claim 11 ; and 

b) reagents necessary to perform a nucleic acid hybridization reaction. 

39. A kit for detecting a mycobacterium in a biological sample comprising- 

a) a recombinant BAG vector according to claim 23 or 29, or a first 
polynucleotide according to claim 1 1 that is immobilized onto a substrate • 

b) reagents necessary to perform a nucleic acid hybridization reaction - and 

c) a second polynucleotide according to claim 11, wherein said second 
polynucleotide is radioactively or non-radioactively labeled, and wherein said 
second polynucleotide and said first polynucleotide have non-overlapping 
sequences. ~* 
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40. A kit for detecting a mycobacterium in a biological sample comprising: 

a) a pair of purified polynucleotides according to claim 20 ; and 

b) reagents necessary to perform a nucleic acid amplification reaction. 

41. A method for detecting the presence of a genomic DNA, a cDNA or a 
5 mRNA of a mycobacterium in a biological sample, comprising the steps of: 

a) contacting the biological sample with a plurality of BAC vectors according to 
claim 23 or 29, or purified polynucleotides according to claim 11 that are 
immobilized on a substrate ; and 

b) detecting the hybrid complexes formed. 

10 42. A kit for detecting a genomic DNA, a cDNA or a mRNA of a 

mycobacterium in a biological sample, comprising: 

a) a substrate on which a plurality of BAC vectors according to claim 23 or 29, 
or purified polynucleotides according to claim 1 1 have been immobilized. 

43. A method for detecting a polynucleotide of mycobacterial origin in a 
15 biological sample, said method comprising: 

a) aligning at least one polynucleotide contained in a recombinant BAC vector 
according to claim 23 or 29 on the surface of a substrate ; 

b) contacting the polynucleotide in the biological sample with the substrate on 
which the polynucleotide of step a) has been aligned ; and 

20 c) detecting a hybrid nucleic acid molecule formed between the polynucleotide in 
the biological sample and the aligned polynucleotide of step a). 

44. A kit for detecting a polynucleotide of mycobacterial origin in a 
biological sample, comprising: 

a) a substrate on which at least one polynucleotide contained in a recombinant 
25 BAC vector according to claim 23 or 29 has been aligned. 

45. The method of claim 10, wherein the procedure by which the 
polynucleotide contained in a clone of a BAC DNA library is prepared, further 
comprises amplifying the polynucleotide insert. 
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46. The method of claim 10, wherein the procedure by which the 
polynucleotide contained in a clone of a BAC DNA library is prepared, further 
comprises digesting the polynucleotide insert with at least one restriction 
endonuclease. 

5 47. The method of claim 45, further comprising digesting the amplified 

polynucleotide insert with at least one restriction endonuclease. 

48. The Polynucleotide of claim 17, wherein the mycobacterium strain is 
Mycobacterium tuberculosis, 

49. The method of claim 36, wherein the amplified mycobacterial DNA is 
10 detected by gel electrophoresis or with a labeled polynucleotide according to 

claim 11. 

50. The kit of claim 40, further comprising a polynucleotide according to 
claim 11. 

51. The kit of claim 42, further comprising reagents necessary to perform a 
1 5 hybridization reaction. 

52. A method for physically mapping a polynucleotide of mycobacterial 
origin in a biological sample, said method comprising: 

a) aligning at least one polynucleotide contained in a recombinant BAC vector 
according to claim 23 or 29 on the surface of a substrate; 
20 b) contacting the polynucleotide in the biological sample with the substrate on 
which the polynucleotide of step a) has been aligned under hybridizing 
conditions; and 

c) detecting the location of the hybridized polynucleotide from the biological 
sample. 

25 53. The kit of claim 44, further comprising reagents necessary for labeling 

DNA and reagents necessary for performing a hybridization reaction. 
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